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Abstract 



We examine the total number of collisions C n in the A-coalescent process which starts with n 
particles. A linear growth and a stable limit law for C„ are shown under the assumption of a 
power- like behaviour of the measure A near with exponent < a < 1. 
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1 Introduction 

A system of particles undergoes a random Markovian evolution according to the rules of the Pitman- 
Sagitov A-coalescent [T3J [14] if the only possible type of interaction is a collision affecting two or 
more particles that merge together to form a single particle. When the total number of particles is 
b > 2, a collision affecting some 2 < j < b particles occurs at the probability rate 



where A is a given finite measure on [0, 1]. Linear time change allows to rescale A by its total mass, 
making it a probability measure, which is always supposed below. Two important special cases are 
Kingman's coalescent [10] with A a unit mass at (when only binary collisions are possible), and 
the Bolthausen-Sznitman coalescent [5] with A the Lebesgue measure on [0, 1]. See [H El [HI [12] for 
recent work on the A-coalescents and further references. 

A quantity of considerable interest is the number of collisions C n which occur as the system 
progresses from the initial state with n particles to the terminal state with a single particle. Repre- 
senting the coalescent process by a genealogical tree, C n can be also understood as the number of 
non-leave nodes. Asymptotic properties of C n are sensitive functions of the behaviour of A near 0. 
In this paper we explore the class of measures which satisfy 



Under this assumption we show that C n ~ (1 — a)n as n — > oo (Lemma [6]) and that the law of C n 
approaches a completely asymmetric stable distribution of index 2 — a (Theorem [9]) . 

The same question for the Bolthausen-Sznitman coalescent has been addressed recently in (6] [7] . 
This can be viewed as a limiting case of ^ with a = 1. However, the technique of [51 [7] is based on 
the particular form of A in that case hence cannot be applied to the general A satisfying ([2J with 
a = 1. 

If A is a beta(a, 2 — a) distribution with parameter < a < 2, a time-reversal of the coalescent 
describes the genealogy of a continuous-state branching process [4] . This connection was exploited 
recently to study a small-time behaviour of A-coalescents [TJ [5] in the beta case. 

We develop here a more robust and straightforward approach based on analysis of the decreasing 
Markov chain A4 n counting the number of particles. The number of collisions C n is the number 

'Research supported by the Netherlands Organisation for Scientific Research (NWO). 

^Postal address: Department of Mathematics, Utrecht University, Postbus 80010, 3508 TA Utrecht, The Nether- 
lands. E-mail addresses: gnedinOmath.uu.nl, yakubovich9math.uu.nl. 




(1) 



A([0,x]) = Ax a + 0(x' 



) as x I 0, with < a < 1 and q > 0. 



(2) 
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of steps needed for Ai n to reach the absorbing state 1 from state n. In Kingman's case J\4 n has 
unit decrements, but in general the decrements of A4 n are not stationary, which is a major source 
of difficulties preventing direct application of the classical renewal theorems for step distributions 
with infinite variance [9]. To override this obstacle we show that when ([2|) holds, in a certain range 
M. n can be bounded from above and below by processes with stationary decrements. It allows to 
approximate C n , and it happens that these bounds can be made tough enough to derive the limit 
theorem. Our method may be of interest in a wider context of the pure death processes. 

By Schweinsberg's result [16] a coalescent satisfying ([2]) comes down from the infinity, hence 
the number of particles existing at a fixed time is uniformly bounded whichever n. Therefore the 
asymptotics of the number of collisions that occur prior some fixed time is the same as that of C n . 



2 Markov chain Ai n 

Let Ai n be the Markov chain whose time ticks at the collision events and the state coincides with 
the number of remaining particles. Since no two collisions occur simultaneously, the number of 
collisions C n in the A-coalescent starting with n particles is the number of steps the Markov chain 
A4 n needs to proceed from the initial state n to the terminal state 1. Note that the number of 
particles decreases by j — 1 when a collision affects j particles, hence the probability of transition 
from 6 particles to 6 — j + 1 is 

<//,!./:: 2<j<b, (3) 

where is the total collision rate of b particles 

Afa = > \ b .j = / j A(dx). (4) 

It is convenient to introduce the sequence of moments 

v h := [ {l-x) b A(dx), 6 = 0,1,... (5) 
Jo 

In view of A(,,2 = (2) ^6-2 the rates Af,^ (b — 2,3, . . .) uniquely determine the whole array Af,j, as 
one can also conclude from the consistency relation 

(b + l)X bij = {b+l-j)X b+hj + (3 + l)A fc+ i J+ i, 

which is equivalent to the integral representation of rates ([1]), see |13j . 
Simple computation shows that the rates can be derived from zVs as 

Vi = C)D- 1 ) i -'( i 7 2 ) l *- a -' (6) 



and, from A6 + i — A5 = bis^-i, we have 

fc-i 



Ab = }] ivj-i. (7) 

8=1 

Since the second difference of bx+ ^ 1 ~^ ^ 1 A(dx) is v b , it also follows that 

b r 1 h-r 4- n - -r^ 6 - 1 6-1 
£0' - l)A bj = / bX + [i 2 X) iA(dx)= 2(6 -*)!/<_!. 



(8) 



J=2 

We shall denote J\, a random variable with distribution 

nJb=j) = q b (j), 
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so the first decrement of A4 n is distributed as J n — 1, and its mean value is 

E [Jn - 1] = £(i - !)*.(?) - ^i-":'^ 1 = -te^ - I- (9) 

Let g(n, 6) denote the Green kernel equal to the probability that the Markov chain A4 n ever 
visits state b. We have g(n, n) = 1, and g(n, 1) = 1 since 1 is the absorbing state reached in at most 
n — 1 steps. Decomposition over the first jump shows that the Green kernel satisfies the recursion 

n-b+l 

g(n,b)= 9nU)a(n-j + l,b), n>b>l. (10) 

The moments of C n can be readily expressed in terms of the Green kernel. 

Lemma 1. The first two moments of the number of collisions in the A-coalescent started with n 
particles are 

n 

E [ C n] =$>(",&), (11) 

b=2 

n I 6-1 \ 

E K] = E^K b ) l + 2E^> J) ■ (12) 

fc=2 \ J=2 / 

Proof. Formula is obvious since the number of collisions is the number of sites b > 1 visited by 
Ai n - Still, it is instructive to derive (TlTj) from the first-step decomposition 

C n =d 1 + Cn-J„+1, (13) 

where in the RHS J„ has distribution ^ and is independent from C\, . . . , C„, with Cj = 0. Taking 
expectations on both sides of p~3")) we obtain 



E [C n ] = h n + £ <z„(j)E [Cn-i+i] , (14) 
i=2 

where ft„ = 1. Replace now repeatedly E [C n _i], E [C n _2] , etc. using this recursion. Collecting 
coefficients at h^ we see that it is the sum over all decreasing paths from n to b of probabilities of 
the path, that is g(n, b) by definition. Using relations g(n, n) = 1 and E [Cu — we arrive at pip . 
To calculate the second moment the equation (| 13[) is squared, so 

C n =d 1 + 2C„_j n+ i + (C„_j n+ i) 2 , 

from which 

n n 

E [C 2 n ] = 1 + 2]T q n (j)E [C n - j+1 ] + £ 1n(m ■ (15) 

i=2 i=2 

This has the same structure as p"4")) with /i„ = 1 + 2J^™ =2 <7„(j)E [C n _j+i] . Expressing E [C„_j + i] 
from pip and using recursion p0[) yield 

n n—j+1 ii-ln-t+1 n— 1 

/i„ = l + 2^ 9n (i) ^2 g( n -j + l,b) = 1 + 2^ ^ g„(i) 5 (n-i + l,6) = l + 2^ 5 (n,&). 

j=2 6=2 fa=2 j=2 b=2 

Following the same line as above we get (fT2")l . □ 
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3 Asymptotics of the moments 



From now on we only consider measures A satisfying @ . Standard Tauberian arguments show that 
in this case 

v n = AT(a + l)n- a + 0(n- aW ) : n -> oo . (16) 



Here and henceforth 



q = mm 



This behaviour will imply that the transition probabilities q n (J) stabilise as n — > oo for each fixed j. 
The relevant asymptotics of A n and X n j appeared in [21 Lemma 4] under a less restrictive assumption 
of regular variation, but we need to explicitly control the error term. 

Lemma 2. Suppose A satisfies (|2|). Then for n sufficiently large 



EAa T(m + a - 2) 2 _ a 
2 — a 1 (ml 

j=m 

uniformly in m = 2, . . . , n. 

Proof. Introduce the truncated moment 



< c 



r(m + a + <;' - 2) 

r(m) 



,2— a-?' 



1 A(dy) 



Integrating by parts we derive from ([2]) that for x — > 



G_ 2 (x) 



2 - a 



+ 0(max{a; Q+< '- 2 ,l}) 



Rewriting fT]) in terms of G_2 and integrating by parts we obtain 

X>«,i = - /e ("Vci-*)" - **^*) = ™( n ) f 1 x^Hi-xr^G^dx, 

j=rn ^° j=m ^ ' ^° 

because the sum telescopes and the integrated terms vanish. Plugging the expansion of G_2 gives 



j=m 



Act r(m + a-2)r(n + l) 
~ 2- a T(m)T(n + a- 1) 



< c 



r(m + a + cr-2)r(n+l) 
T(m)T(n + a + <r - 1) 



for c; < 2 — a, in which case the result follows from the familiar asymptotics of the gamma function 
r(n + (3)/F(n) = n 13 + 0(rt' 3 ~ 1 ) (n — > oo). If c > 1 the error term in this expansion constitutes the 
main part of error, yielding appearance of e' instead of The case e > 2 — a is treated in the same 
way. □ 

Corollary 3. If measure A satisfies @ then as n — ► oo 



A n — 



2 - a 
AaT(j + a - 2) 



(17) 



q n (j) = (2 -a) 



for every fixed j . 
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Proof. The formula for A rl follows by the direct application of Lemma [5] with m = 2. Expression for 
X n _j is a difference between two subsequent tail sums. The ratio of these quantities gives q n (j)- d 

Thus J n converge in distribution. The convergence in mean is also true. Note that the mean of 
the limiting distribution of jumps Jb — 1 is 

f:U-m-a)^ = ^-. (18) 
* — ' 7 1 — a 

Lemma 4. If @ holds then the mean decrease of the number of particles after collision satisfies 

E [J n - 1] = — !— + O (n 
1 J 1 — a V 

Proof. By assumption (J2J) relation p6[) implies the existence of constants uq , c such that 

\v n -i ~ AT(a + l)n~ a \ < cn~ a ~ q> , n > n . 

Approximating sums by integrals yields, as n — > oo, 

— l n — 1 / n—X \ 7iQ — 1 

2 = ^ at( q + i)r Q + o E rQ "' + E 

i— 1 2— no \2— no / i=l 

pi 



^ f — min{l — a, ^} 



= AT(a + l)n 1_a (l + 0(1/ n)) / x~ a dx + 0(n l - a - q ) + 0(1) 

J no / n 

= Ar(a + 1) n 1 -" + 0(max{l, n 1 ^'}) = ^E^±ll n i-« + ( m ax{l, n 1 "^}) 
1 — a 1 — a 

by definition of Substitution of this expression into ([5]) and applying Corollary [3] finishes the 
proof. □ 

Example. It is possible to choose measure A so that the decrement probabilities for j < n are 
exactly the same as for the limiting distribution truncated at n, in which case the envisaged limit 
theorem for C n follows readily from [9]. To achieve 

g n (j) = (2-a)^— (j =2,...,n-l , q B (n = 2^(2 - a)^— = 

7! z — ' 7! n! 1 a 

one should take the measure 

A(dx) =a(l-%) x a - x dx + ~ Ji(di), 



which is a mixture of beta(a, 1) and a Dirac mass at 1. Adding Si does not affect A nj for j < n, so 
the integration in fl} yields 

An, = ^fl-g) r( ^r 2)( "" i)! (i=2,-,n-l), A„,„ = fl-^ < 



j7 V 2/ r(n + a-l) w 7 ' *' V 2J n + a-2 2 

Summation (or direct integration of Q) implies the desired expression for q n {j)- 

That a positive mass at 1 does not affect the asymptotics of C„ is seen e.g. by observing that 
the probability of total collision implied by this mass is of the order smaller than n^ 1 , namely 
Qn(n) = 0(n a ~ 2 ). On the continuous time scale of the coalescent, the mass at 1 is responsible for 
the total coalescence time (independent of n), hence the insensibility of the asymptotics to A({1}) 
may be explained by the effect of coming down from the infinity, as mentioned in Introduction. 
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The example also demonstrates that taking minimum in the error term of Lemma 0] is necessary. 
Indeed, <;' = 1, however direct calculation using (fTS)) shows that 

E [J n - 1] = £(j " 1)(2 - <*)Mp + = i TT^uT^ 1 + • 

L J 1 — a j' 1—a (1 — a)l (a) 

So the error term is 0(n Q_1 ), and not 0(n -1 ). 

Lemma 5. // ([2]) ZioZrfs f/ien there exists v £ ]0, 1[ such that limg(n, k) = 1 — a as n, k — » oo in 
such a way that k > n v and n — k — ► oo. 

The heuristics for this is that the jumps Jb — 1 become almost identically distributed for large b 
and their mean is close to 1/(1 — a). If the distributions were indeed the same with mean 1/(1 — a) 
then the Lemma would follow from the renewal theorem. We postpone a rigorous proof to Section|3J 

Lemma 6. If ^ holds then 

C n ~ (1 — a)n (n — > oo) 

in probability. 

Proof. The argument is based on formulas of Lemma [1] Indeed, immediately from Lemma [T] and 
Lemma[5j E [C n ] ~ (1 — a)n as n — -> oo. Similarly, as b — > oo, 2j=2 9{b>j) ~ (1 — ot)b hence 



\2 2 



E K] ~ E 2 9{n, b)(l - a)b~ (1 - a)' 

b=2 

Application of Chebyshcv's inequality completes the proof. □ 

4 Stochastic bounds on the jumps 

In this section we construct stochastic bounds on the decrements Jb — 1 of Markov chain J\4 n in a 
range b = k, . . . , n, to control the asymptotic behaviour of C n . Specifically, we find random variables 
^nik anc ^ ^nik ^° secure the distributional bounds 

J nlk <d Jb <d Jnik ■ ( 19 ) 

uniformly in some range b = k, . . . ,n. Here <d denotes the stochastic order, meaning that two 
random variables X and Y satisfy X < d Y iff ¥[X <t]> ¥[Y < t] for all t. 

Our approach to establishing the limit theorem for the number of collisions is based on con- 
structing random variables J^, fe and J~^ k which on the one hand comply with (|19p and on the other 
hand yield the same limit distribution of the sum of their independent copies. These two require- 
ments point in opposite directions, forcing an adequate choice of these random variables to be a 
compromise. We define the distributions which depend on parameters £ ]0, 1[ and 9 £ ]f3, 1[. 
The calibration of these constants will be done later. For n > k > define 



( K 2-"" 7 A„(3:|K ,3 j)+A„(Ln' 5 J+l:n) _ X e ([n^]+l:i 

' — max — * 

e{fe,...,n} 



2 „_ max , x e ' -?= 2 ' 



J = 3, ■ • ■ , KJ , 



2 max — * — r '- — 2 max — * — r '- , i — \n , 



A„,j(l+n-T) 

An ' 

X e (ln f3 XeUn" \ + 

— i — r '- — 2 max — - '- 

ee{k,...,n} 1 ee{k,...,n} 

* ' J= "' (2°) 

0, otherwise, 



j = 2, 



A fc , 2 +n~ T A fc (3:Ln' 3 j)+A fc (Ln g J+l:fc) 

0, otherwise, 



(i 



where 



A„(to : fc) = ^ A r , 



Note that J2 3 Qnikd) = Qnikd) = Moreover, q n ^ k (j) are nonnegative for large enough n and 
fc. Indeed, the inequality q T ^ fc (j) > is obvious. Lemma [2] implies that if n and fc are large enough 
and k > n 13 then 

ci „ Xt([nP\+l:i) „ c 2 



i/3(2-a) 



for some C2 > ci > uniformly in £ £ {fc, 
and it follows that q^ikU) > 0. 



A £ " n^( 2 -«) 

ti}. Hence (f2~Tj) holds for the maximum over these 



(21) 



Hence, quantities q n \ k {j) define some probability distributions on N, at least for large enough 



^ik 

and fc. Let J^ k and J~< k be random variables with these distributions, so 

Wifc = J] = (J) ^d P[J- ifc = .?] = ^ (j). 
Lemma 7. Suppose that (3, 7, and u satisfy the inequalities 

(v-(3)(2-aW 



(22) 



1 > w > 9 > (3 > 7/(2- a) > 



7 < 



2 - a - ?' 

T/ien i/ie stochastic bounds (|19p Zio^rf /or n /arge enough and b, k in the range In" \ < k < b < n. 
Proof. By definition of the stochastic order, we need to show that for all to 



(23) 



J nlk > m 



< P [ J b > TO] < 



J nlk > m 



(24) 



The first inequality (j2"4")l is clearly true for m > [ti' 3 ] + 1 because the left-hand side is zero and 
for to < 2 because both sides are 1. Suppose 3 < m < n@ , then the first inequality reads as 



X b X k (m : [n fi \) (1 - n"T) < A fe A b (TO : 6) . 



(25) 



Since 6 > fc > [to v Jj taking n sufficiently large enables us to apply Lemma and Corollary [3] to get 
asymptotic estimates valid for all b in the range fc < 6 < n. From the definition of Afe(TO : [n' 3 ]), 
Lemma [2] and the inequality 

nm + a + ^-D^iin^+a^^) for m < 



T(m) 



r(KJ) 



wc obtain 



A fc (m : KJ) = A* (m : fc) - \ k (KJ + 1 : *) 

/T(m + a-2) r([n^J + a - 1) 



2 -a V r M r (IVJ +1 

Hence we rewrite the inequality as 

Act /r(TO + a — 2) r([n' 3 J+a-l) 



'„2-a 



o 



r(m + a + c / -2) fe2 _ _ c , 
T(to) 



2 - a 



r(m) 



r(KJ + 1) 



•,2-a 



r(TO + a + ^'-2_) fc2 _ Q _ 



r(m) 



/ J 4F(a + r 
V 2-a 



< 



lri " A^ + ^fc— ') 



2 - a 



Aa r(m + a-2) b2 _ a 



2 - a 



T(m) 



r(m + a + ^-2) 2 _ a _ f , 

r(m) 
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The leading terms on both sides cancel, and simplifying this inequality we are reduced to checking 



r(m + a W - 2) k2 - a -, b z- a \ + Q mm + a - 2) 
r(m) / \ r(m) 

A 2 a l> + 1) /r(Ln^J+a-l), 2 _ Q , 2 _ Q , T(m + a-2) h2 _ au2 _^ 



< 



(2 -a) 2 \ rfln^J+l) T(m) 



(In the above lines we neglected certain lower order terms using inequalities like b 2 a k 2 a s > 
^.2— a ^ rj^g desired inequality follows from the following two inequalities 

Cl < — J7- — , ; — ' -tf + — -i — ^— k" n~ 7 , (26 

C2 < m jf ( {m l ^ + V'n^ , (27) 

with sufficiently large constants c\ , c 2 > (twice the ratio of a constant implied by the corresponding 
O(-) and the constant in the right-hand side is enough) and C3 € ]0, 1[. 

The right-hand side of (|26|) considered as a function of to G [3, \ n^\\ has a unique minimum 

/ ? / \V(2-a) 

which is attained atm'~ I — — — I n P-i/P-° i ) anc i has the value asymptotic to 



c 3 r(m') r(m' + a-2) 



n /3(2-a) r ( m / + a + «-/ _ 2 ) r(m' + a + - 2) 



_^'_ 7 (2-a-0/(2-«) fc f'_ 



Since fc > [^J the RHS grows to infinity once ([23)1 holds. 

In inequality (|27|) we neglect the first summand in the RHS and still have the function b 1 ~ a n~ 1 > 
~ 7 which grows to infinity with n once ([23| holds. Thus the first inequality in l|24[) holds for all 

sufficiently large n. 

The second inequality in (|24jl is obvious for to > [n' 3 ] + 1 and for to < 2. Suppose 3 < to < n' 5 . 
The inequality can be rewritten as 



X b (m : b) ^ X„{m : n) . . , . ^ _ MlVJ +1:1) 



< (i + n -7) + (! 



n 7 1 max 



Ab X n £e{fc,...,n} A 



max MKJ+l:£) A„(Kj+l:n) 



«e{fc,...,n} A^ A„ 

The latter follows from a simpler inequality 

A„A fc (TO:&)<A h A„(m:n)(l + n- 7 )+A b A„(l-7i- 7 ) max ML?Ll±i : £ ) , ( 2 8) 

££{fc,...,n} Af 

Since > [n^J , application of (|2Tj) implies 

max — 11 — = > 



ee{k,...,n} Xi n f3{z-a) 

for some C4 > 0. We suppose that n is large enough to satisfy 1 — n^ 1 > 1/2. These observations, 
Lemma [2] and Corollary [3] allow us to rewrite inequality (|28|) as 

Mr(a + 1) 2 _ a / n 2-a-,'\\ ( Aa r(m + a-2) 2 _ a / r(m + a + ^ - 2) 2 _ a _,, 
V 2-a V /yV 2 -a r(m) \ r ( TO ) 

_Aa_ r(TO + a ~2) ^ 2 _ a / r(7n + a + ^-2) ^ 2 _ a _ g 



2 — a T(to) \ T(m) 
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for some C5 > 0. Simplification shows that this inequality holds provided 
r(m + a + - 2) u2 _ a _, 2 _ a r(ro + a - 2) . a _„ 2 _ _ 



C(>- 



6 2-a- 5 ' n 2-a < V- ~r » ^ ^2-^2-^ + c ^^ B (l-W(2- a ) 



r(m) r(m) 

for suitable constants C6, C7 > 0. Further simplification gives 

6 " \r(m + a + s'-2) 7 T(m + a + ,'-2) n J' [ ^ } 

Proceeding as above, the expression in brackets attains its minimum in m G [3,n^] at m" ~ 
Cg71 /3- 7 /(2- Q ) ; Cg > 0) with the 

minimum value asymptotic to 

r(m" + Q-2) T(m") -g(a-a) „ _ / 9 5 '_ 7 (2-a-0/(2-a) 

r(m" + a + ^'-2) + 7 r(m" + a + ^-2) 9 

where eg > 0. Since b > \n v J the right-hand side of grows to infinity as n — > 00 as long as (|2U|) 
holds. This observation finishes the proof. □ 

We want to keep control over the difference between distributions of J^ k , J^ik ana - " > 6 > fc. 
In particular, the following statement provides bounds for divergence of means. 

Lemma 8. Suppose j3 < v < 1. T/ien rj/iere exists c > smc/i f/iai /or n Zctrge enough and for k in 
range n > k > \_n v \ the following inequalities hold: 

E [ J nik - 1] - E [J fc - 1] I < c max {rT^ 1 ^, n^} . 
Proof. We start with the following observation. For 2 < m < b, as 6 — * 00 but m/6 — * 0, 

Jo „■_„, \J/ Jo ,_„, \3J 



j—m j— m 



77?. 







(I — a)T(b + a — l)T(m — I) ' 



where 

^^M^-^L^ as *-0. 



y I - a 

Taking m = In^J +1, for some [3 £ ]0, u[ we see using Corollary [3] that 

as n, fc — > 00 with n > k > \n v \ . 

Now the proof follows by a simple calculation. The mean of J~^ k can be estimated using Lemma|3] 
and (3DJ): 

E r J - _ ll = A n , 2 -rc-TA n (3: KJ) + A»(KJ : n) L ^ J (j - l)A ra , J -(l + n -*>) 

+ 2(Kj- 2 ), 6 - > MKi±^) +2( „ , _ [n . J) MKi±^> 

= E[J„-1] (l + n -, ) _"-^=L^J) + A n (L^J + l:n) 

A n A„ 
_ £ (j - l)An,j +Q ( majc | n fl-/3(2-a) )n l- fl (2-a)^ 
J=|yj+1 

= E[j n -l] +0(max{n^,n-' 3 ( 1 - Q ),77 e -^ 2 - Q ),n 1 - e ( 2 -«)}) . 
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Similarly, since v > (3 formula (|50|) is applicable and implies together with Lemma that 

n- 3 I 

E r J+ _ 1 i A fc , 2 + n^A fc (3: Kj)+A fc (Kj+l:fc) ^ J (j _ i) Afcj _ ^_ 
L nlk J A fc ^ A fc ^ n > 



E[J fc -l](l -«-**)- £ (J A 1)Afc ' J (l 

= E[J*-1] +0 (max{ )l ^ (1 - a) , J i-''}) 



J=3 

_ A fc (2: KJ) , Afc(KJ+l:fc) 



Afc ' Afc Afc 

j=L« /3 J+i 



so the claim follows. □ 

Using a familiar device, Lemma [7| enables us to couple random variables J„i k , <K and J~\ k in 
such a way that 

J+ lk <J b < J- lk (31) 
holds almost surely. From this Lemma [5] will follow by comparing M. n with two random walks. 

Proof of Lemma\^ For n > b > k > 1 let g + {n[k,b) and g~(nlk,b) be the Green kernels of 
decreasing random walks started at n with decrements (J^if. — 1) and (J~\k — 1), correspondingly. 
Suppose that (|31[) holds for some n > k and for all 6 in range n > b > k. Then we have g + (njfc, b) > 
g(n, b) > g~(n\,k, b) for all such b. 

Take 7 > and v > 9 > (3 > 1/(2 — a). Combination of Lemmas |4] and [8] implies that 
E [J^J — > 1/(1 — a) and E [J^i J —> 1/(1 — a) as n, k —* 00 with fc in range n> k > [n v \. 

Assume further that parameters j3, 7, v and 9 are chosen to satisfy inequality (|23|) . Then the 
coupling (|3Tj) exists for n > b > k > [n v \ by Lemma [3 Applying a standard result of renewal 
theory, 

q + (n\k,b) ~ — ^— r = > 1 — a and o~ (nlfc, 6) ^ — = = > 1 — ce 

E[j+,-l] E [ J nlk ~ X ] 

as n, fc — > 00 and n — b — > 00. Thus 17(71, 6) has the same limit. □ 



5 The total number of collisions 

We are in position now to present our main result on the convergence of the number of collisions 
C n in the A-coalescent on n particles. 

Theorem 9. Suppose that the measure A satisfies ([2]) with q > max 1 5 ^ 5a ^ a 2 ; 1 — a |- Then, as 
n — » 00, we /lave i/ie convergence in distribution 

C n - (1 - a)n 

(1 - a)n 1 /(2-«) ^ d ^ 2 -« 

to a stable random variable S2- a with the characteristic function 

E [e iuS2 —] = exp (- e - inasi z a ( u V 2 \u\ 2 - a ^ . (32) 

We emphasize that <; = 1 satisfies assumptions of the above Theorem for all a € ]0, 1[. This is 
important because <; = 1 appears, say, if A is a beta-measure. 

Remark. The characteristic function (l32|) is not a canonic form for the characteristic function 
of stable distribution. There are several commonly used parametrisations for stable variables, see 
[151 118] ; the difference between them being a frequent source of confusion. Apparently the most 
common parametrisation involves the index of stability a G ]0, 2], the skewness (3 G [—1,1], the 
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scale a > and the location /i € R, so that a random variable 5 has the stable distribution with 
parameters (a, /?, cr, fj,) iff 

f exp (-CT Q |-u| a (l — i/3tan ^ signu) + iuu) , a/1, 
E [e l = < / V \ \ 

L J \exp(-a\u\(l + ^(signu)\og\u\)+ifiu), a = 1. 

In this parametrisation our random variable iS2- a has (2 — a, —1, (cos z^) 1 /( 2 - Q ) j 0)-stable distri- 
bution since its charactersitic function can be rewritten as 

exp ^_ e -i™Big°(«)/2| w |2-a^ = exp(-cos^|u| 2 - Q (l - i tan ^ sign u)) 

= exp f- cos 2^ | U |2- Q (i + i tan si S n ")) ■ 

Thus S2-a has (2 — a)-stable distribution totally skewed to the left. 

The main idea of the proof is that the decrements Jf, of are almost identically distributed for 
large b, as Corollary [3] suggests. However the nonstationarity prevents the possibility of any direct 
analysis. To override this, we use the technique of stochastic bounds introduced in the previous 
section. First we introduce some auxiliary notations. 

For 1 < k < n the coalescent started with n particles after some series of collisions will reach a 
state with less than k + 1 particles; let C n ik denote the number of collisions and let B n ^ < k denote 

n[k,m 

|fc> 



the number of particles as the coalescent enters such state. In particular, C n — C n j,i. For 
independent copies of J^\ k , introduce 



■■= c: E (j^ m ~ l) > t\ , C"^ := min J c : £ (./„<.„. - l) > A , (33) 

I m=l J I m=l J 

the minimal number of decrements distributed as J^\k ~ •"■ (respectively, J~^ fe — 1) needed to drop 
by at least I. We skip the index i when it is equal to n — k, so that C^, fe = C^, fc n _ fc . 

Under assumptions of Lemma [7| we can couple the corresponding Markov chains so that (f3"Tj) 
holds almost surely for all large enough n once n > b > k > \n v \. Consequently for such n the 
coupled Markov chains satisfy 

C nl\ri»\ ^ CniLn«J > ^niL^J ' 

In other words, 

C nllnv} C niL „-j > d C~ iLn „j . (34) 

In order to find the limit distributions for C„j, -uj we need the following statement about the 
characteristic function 

4> n (u) ■= IE 



e iu(J„-l) 



of the first decrement of .M n . 

Lemma 10. Lei A satisfy @ luii/i <r > 1 — a. Then there exists 8 > swc/i that 
K (s/m) = 1 + -A- " + O (m 0-2- ') , 



as n, m 



00 mf/i m < n v for some v < 1, where co(s) — e i,rasi e n W/ 2 . 



Proof. We write for shorthand u = s/m. For it = the claim is obvious, so we suppose that u^O. 
The characteristic function of J n — 1 can be written in terms of A as follows: 

Mu) = e - £ ^ = ^ f 1 (i-d-e^r-d-xr-^-d-.r-^^ 



._ 2 A n A„ j 
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using the integral representation of \ n ,j- Denote the numerator of the fraction under the integral 
above by h n (u, x); then 

h n+ i(u, x) - h n {u, x) = x(l - e iu ) ((1 - x) n - (1 - (1 - e ±u )x) n ) + x 2 n(l - x) n ^e iu 

so using d?]) we obtain 

= 1 2^ / - - > i '-A(dx) (35) 



An 

3=1 



(I 



because a;) = 0. Taking again differences of (1 — x) J — (1 — (1 — e 1 ")^ with respect to j and 
calculating it directly for j = we represent the integral in ([33)) as 

3-1 »1 3-1 »1 

(l-e^V/ (1 - (1 - e iu )a:) fc A(da;) - V / (l-x) fe A(^). 
Exchanging the sums and utilising notation ([5]) for moments Vk of A we get 
Mu) = l + — r '-Y / (n-k-l)v k + { ~ 6 / 6 VVn-fc-1) / (l-a-e 1 ^)^^ 







By (jHJ and Lemma |4] the second term above is 

1 "' j - ^(n - fc - IK = (1 - e-^E [J ra - 1] = ~ & _ (1 + (n"- 1 )) 



.rj. 

(36) 



An 



fe=0 



since ^ > 1 — a by hypothesis. Recalling notation u = s/m and inequality n > m 1 ^ v with u < 1 we 
see that 



(1 - e -"/ m ) 



n-2 



^(n-fe-lK=- — + 



A„ ^ ^ " (l-a)m 



rn 



a-2-5l^ 



for some (5i > 0. Thus it remains to estimate the last summand in (|36p . 
Integration by parts gives 

(1 - (1 - e lu )x) k A(dx) = e iku + k(l - e iu ) / (1 - (1 - e^x)^ 1 A[0, x]dx. 
o Jo 

Substitution of this relation into (13U1) leads to 



<f> n (s/m) = 1 + - " + [ - )G V(n - k - l)e ifcs/r ' 

I 1 — CM I7T7 A„ ^ — ' 



(1 — a)m A 

(1 - e is / m ) 3 e 



fe=0 

is/m\3~ — is/m /*1 n_ ^ 

— 1 / ^fc(n-fc-l)(l-(l-e is / m )x) fe - 1 A[0,x]da; + O(m- 1 - ? / u ). (37) 

Summation yields 

("1 _ „is/rn\2 -is/m "~ 2 f ,—is/m(~.(-\ _ p is/rn\ , isn/m _ i \ 

(1 C -li V(n - fc - l) e ifes /" 1 - K1 e i2 . 

An An 
fe=0 



For m big enough and n > m}/ v with v < 1 
n(l - e Wm) + e Wm _ j 



A)) 



< n|l-e ls A"| + | e W m _i| < const = Q 
A n ~~ n x ~ a m 



12 



for some 82 > 0. 

Let 9 6 [— 7r/2,7r/2] be such that e ifi 
m — > 00. For any (3 > we have 



Note that 6» = -7rsign(s)/2 + 0(l/m) as 



k(l - e is/m ){l - (1 - e"/™)^)^ 1 ^^ 



fcl 1 — e ls/ " 



(l - e i6 t/k) 



fc-1 



f 3 



(fc|l-eW-|)/3 (fc| s |/m) 



r r(/3 + l)(l + 0(l/m)) 



as m,A -* 00 with fc > to 1+53 for any £3 > 0. By assumption ([2]) we can write A[0, x] = Ax a + f(a 
where \f(x)\ < cx a+q for some c > and all x £ [0, 1]. Thus, as m, k — > 00 with fc > 



fc(l - e is/m )(l - (1 - e u,m )x) k - 1 x] dx 



Agina sign(s)/2 



(k\s\/m) 



— T(a + l) + 



f, a +<; 



Take £3 = (l/i> — l)/2 and denote no = [m 1+<5:i J . Divide the last sum in (|3"T|) into two sums over 
k > n and k < n . The first sum is estimated taking (fTT)) into account as 



(1 - e is /m)2 

An 



n-2 „i 

V (n - k - 1) / fc(l - e is/m )(l - (1 - e is/m )a;)' c - 1 A[0, a?] dx 
,. _ Jo 



fc-no 



|.|W«»-^W/»(2-oO g (n _ fc)fc - 



fc— no 



n-2 



n-2 



1 x ^ n — k 1 x ^ 

— a^2 — a — / ; k a ^^ rl 2 — a rn 3 — a ' > 
\ k—nQ 

| s |2-a e i7r Q! sign( S )/2^2_ Q ,) 



n — k 



n^~"m"~" ' — ' k c 

k—no 



2-(5, 



aT°(l - + O -: T — r + 



3-Q 



I „|2-a i7ra sign(s)/2 

= - n ^ 5=5 + 0(m«- a - a *) 
(1 — ajm z a 

for some £4 > 0. The same argument applied to the sum over k = 0, . . . , no — 1 shows that it 
constitutes a lower order term to the whole sum. Thus it remains to combine the results above to 
get the statement of Lemma. □ 

Next we show that under certain assumptions the same asymptotic expansion is also valid for 
the characteristic functions of J^\k 



#ik(«) : = E 



e iuJ iik 



e iuJ n ik 



Lemma 11. Suppose (|2J) holds with q > 1 — a and that the parameters in (|20[) satisfy inequalities 



1 — a 

2 — a 

Then there exists 5 > such that 



and v > 9 > (3 > 



2 - a 



(38) 



^ Mm) = 1 + — V - ( f S)l fZ a + O (m--') , 
(1 — CKjm (1 — ct)m a v y 

(«/"») = 1 + n ~~r - ,f s)l f7 a + ° (™ a - 2 - s ) , 

™ u w 7 (1 - a)m (1 - a)m 2 -« v ' 

as n,k,m — > 00 m swc/i a way </iai n > k > [n v \ and m < cn 1 ^ 2- ") /or some c > 0. 
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Proof. The characteristic function of J n ^ k — 1 is by definition 

, iu /A n , 2 -n-TA n (3: KJ)+A„(KJ + l:n) A*(KJ+1:£) 
^nifeW = e — ; 2 f? ax 



U V ^" «e{k,...,n} Af 

+ V ^■( 1 + "" 7 ) e ib--i) tt + 2e KLn'j-i) U max ML" g J + i = 

^ A„ £e{fc,...,n} X e 



+ 2 ^e 1 ^ 1-1 ^" — eid™"]- 1 )"^ 

We rewrite it as follows: 



X e ([n 9 \ + 1 : £) 

max — — — =■ - 

ee{k,...,n} X( 



i — / \ , , W1 , _ 7 n - 7 i„ , A„(|yj + l:n)(l + n-T) 



17/ 



V (1 + n-^j eltf _ 1)w + 2 / e i (L ,»j-i)u _ ^ max MKJ+1:^) 
j= fe X n \ ) mkl ... M X e 



+ 2 / e i(n-l)-_ e t(Ln'j-l)^ max M^lM. 

V / te{k,...,n} Xe 

Four last summands are of the order of n~^ 2 ~ a ' by Lemma [2j From (|38| . /3 > 1/(2 — a) and so 
the bound m < cn 1 ^ 2- °^ guarantees that these four summands constitute 0(m a ~ 2 ~ Sl ) terms to 
the whole sum, for 8i, 5%, . . . some positive constants. The same bound on m allows application of 
Lemma [TU] for (j) n (s/m) which leads to 

r nik {s/m) = (l+ -A- - ( f S)l f7- a + 0(m a - 2 - 5 * )) (1 + n^) 
1 \ (1 — a)m (1 — a)m z a J 

1 + — + 0(m- 2 )] + 0{m a - 2 - Sl ) 



in 



= i + tt^t - ,r (s) J£;i + (t~~~~ ~ i ) ~~~~~ + qk^) 



(1 — a) 77i (1 — a)m 2 a \1 — a 

and the claim about <^~i£ follows from inequality 7 > (1 — ot)/(2 — a). 
Analogously, 

,+ . ,_ A fc , 2 + n^A fc (3:Kj) + A fc (L^J+l:fc) lu L ^ J A fcj (1 - n'^e^- 1 )" 
J_ A fc A fc 

= ^£ (1 _ n -7) eif + n -7 e iu + A fc (L^J+l:b) (1 _ n _ 7) giu 
Afc Afc 

^ A &J (l-n-T)^ i(j ._ 1)u ^ A fcJ (l ~ n-i) ^ {j _ 1)u 



fc (u) (1 - n-T) + n-V" + (1 - n^) £ fejl ~ ' ~ " " 



-e 



Afc 
j=Lr^J+l 

Since fc > ["-"J with u > /3 > 1/(2 — a) and m < cn 1 K 2 ~ a \ Lemma [5] ensures that the last sum 
above is 0(n-^ 2 ' a ^>) = 0{m a - 2 - Si ). Since m grows slower than k by hypothesis, Lemma 1 101 can 
be applied for 0fc(s/m) and the claim again follows from inequality 7 > (1 — cc)/(2 — a). □ 

Lemma 12. Suppose ((2|) holds with <; > 1 — a and that the parameters in (|20|) satisfy inequalities 

1 — a 5 — 5a + a 2 , . 1 — a 3 — 2a , . 

7>- , l >v> e>p>— — , 0(2 -a)-- >0> To rj. 39 
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Let S^, k h , respectively S n , k h , be the sum of ft independent copies of J„\ k — 1, respectively J n ^ k — 1. 
Then 



S: ik , h -h/(l-a) 



>d S2-01 



S nlk,h- h /( l - a ) 



S2-C, 



(V(i-a)) 1/(2 - Q) ' (V(i-a)) 1/(2 " Q) 

as n, ft. — > 00 wii/i ft = O(n) and n > k > \n v \, 1 > v > (3, where S2- a is a stable random variable 
with the characteristic function 



E 



= cxp[-u(u)\u\ 2 - a ] , 



uj(u) = exp 



i7ra sign u 



(40) 



Proof. First note that a solution of inequality (|39|) always exists. Since (|38|) follows from (|39|) . the 
bound ft = O(n) guarantees that Lemma [TTl is applicable with m — (ft/(l — a)) 1 ^ 2 a '. Lemmas [4] 
and [H] provide tough bounds for E [J^ik ~ l] • Namely, inequality (|3l?)) implies that 



for some 61 > 0. Hence 



-(l-a)/(2-a)-*! 



(41) 



>/(l-a)) 1/(2 - a) 



exp 



(ft/(l-a)) 1/(2 ~ a) 



is = 1 - 



co(s)\s\ 2 



0{h 



-1-S2 



for some 82 > 0. Moreover, equation (|4ip and ft = 0(n) imply 



as n, ft — > 00. Hence, for some £3 > 



= Q / fe l-l/(2-a) n -(l-a)/(2-a)-«^ = Q 



E 



exp 



is 



0— h 
°njk,h 1-a 

,(ft/(l-a)) 1/(2 - Q) 



exp 



l-a 



,(ft/(l-a)) 1/(2 ' 



is E 



exp 



(l + 0(/T*)) 1 



w(s)|s|' 



0{h 



S nlk,h E [^njfc.fa] 

% (V(i-«)) 1/(2 - a) 

-1-6, N ' 



exp 



and the claim about S n ^ k h follows. 



Treatment of the limit theorem for S*, k h literally repeats the above steps and is omitted. □ 

Let i<2-a(-) be the distribution function of the stable random variable S2- a defined by (|32)) and 
F2-a{-) be that of §2- a defined by (|40|) . Note that the random variables S2- a and —S2- a have the 
same distributions, i.e. F2- a (t) = 1 — i*2— a( — t). 

Lemma 13. Let the measure A satisfy @ with <; > max | ^L^a+a 1 ' ^ — a } ' ^ erl ^ ere exists V < 1 
sweft <fta£ 



iV(2-o)(l- a ) 



> d 5 2 - Q as n — > 00. 



Proof. Suppose that (3, 7 and 6* satisfy both inequalities (|123"|) and ([55)) . This is always possible if ? 
satisfy condition stated in the Lemma. Indeed, the only constraints which can become inconsistent 
by joining inequalities (|23[) and are the constraints on 7 



(u- / 9)(2-aV l-a 
> 7 > 



2 - a - ?' 



2-a 



(42) 
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By (|39| we can choose f3 and v such that v — (3 < 1 — 5 p-a)" • Hence (l42j) is solvable for 



1 5-5a + a 2 \ (2 - ape' 1-g 



(2 - a) 3 



2 - a - <;' 2 - a 



Resolving from this inequality and recalling its definition leads to the lower bound on <; in the 
claim. 

By definition C„n n «j ^ is the random number of decrements ■/^ji n «j — 1 needed to make a total 
move larger than d. Hence for all h > 



C 1 1 vi ,<h 

ni\n v \, d — 



(43) 



Take now d = n — [n v \ and /i = I (n — [n v \ + tnti 1 ^ 2 a >\ (1 — a) J where t n — > t as n — > oo. Then 
/i — ► oo but /i = 0(n) as n — > oo. Moreover, 



tf = n - \ n v \ 



I- a 



f 



1 — a 



l/(2-a) 



(l + o(l)), n^oo, 



and application of Lemma \TZ\ ensures that the RHS of (|43[) converges to i*2- a (— i) as n — > oo, since 
-?2-a is continuous [181 . Thus 



n l/(2-a)(! _ a ) 



< t 



C+ vl <h 

n|L™ J — 



si 



> 



- 1 



l/(2-a) 



(l + o(l)) 



1- F 2 _ a (-t) =F 2 _ a (t). 



Replacing S ,+ , , „ , , with , „ , , and C + , , „ , with C - , , „ , in the above argument we obtain 
"^ U „.j-(^-KJ)(l-a) 



< t 



F 2 - a (t). 



nV(2-a)(i-a) 

Hence the claim follows from inequalities (|34|) since J^-a is continuous. 



□ 



Proof of Theorem^ Recall that B n ,k is a number of particles in the A-coalescent started with n 
particles right after the number of particles drops below to k + 1. Then for any k < n the total 
number of collisions is decomposable as 



Cn — d C n 



where in the RHS (C^) is an independent copy of (Cfe). This can be iterated as 



C n - C nlkl + C ( gl kilk2 + C^ k2lk3 H h ^ Bnikf l ik e 
for any finite sequence kg < kg-\ < • • • < k\ < n, with the convention that C^ik — for b < k. 



-,(2) 



(44) 



Suppose that Lemma Q2] holds for some u < 1 . Let I = 
For each m — by Lemma [T31 applied with \ n v 



-log(2-q) 
logu 

instead of n, 



C 



L«" m JlL«" m+1 J 



(K'"'j-k'" +1 j)(i- q ) 



n „»V(2-a)(i _ a ) 
Consequently, since v < 1, for all to > 



< £ 



, then t/ +1 < 1/(2 -a). 



F 2 - a (t). 



C 



'J 



(K"j-K m+1 j)(l-a) 



lV(2-a)(l_ a ) 



77. — * OO, 



1G 



where Sq is the (5-measure at zero. Moreover, for some fixed r £ ]0, 1/(2 — a)[ starting the coalescent 
with \n v — n T \ particles instead of \ri" J particles results in the same asymptotic behaviour: 

CVwjH^+ij " (KH " K" +1 J) (1 - «) b 
nl/(2 _ Q)(1 _ Q) **o, n-oo. 

Denote by E m the event B n i n «»»j > n""" — rt T , i.e. that the Markov process M n undershoots n"™ 
not more than by n T . Thus, given E m , 

c B „ miL n-" +1 j " (L^'"J - K m+1 j) (i-«) 

_ 7 ____ > dSo , „_>«,. 

by monotonicity of the number of collisions. Using (|4"4")l with fe m = [^'"J an d Lemma ITBl yields the 
desired convergence for C n , given E? m holds for all m = 1, . . . , £. 

Lemma [2] ensures that for any r > the probability of E m grows to 1, as n — > oo, for all 
m = 1, . . . , t. Hence P [fl„ =1 S m ] — > 1 and the convergence in distribution conditioned on Cif n=1 E m 
is equivalent to the unconditional convergence in distribution. □ 
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